DYX-016.0 US 



RINDING PRPTTnFS FOR CAR nNOFMRRYONir ANTIGEN (CEA) 



FTFT D OF TH F INVENTION 

The present invention relates to CEA binding polypeptides and compositions for 
detection and treatment of cancer. More particularly, the invention relates to materials 
useful for and methods of detecting, imaging, localizing, and targeting tumors exhibiting 
CEA. The invention provides binding polypeptides capable of associating specifically 
with CEA and of distinguishing between CEA and known cross-reactive antigens, such as 
NCA (non-specific cross-reacting antigen). Such binding polypeptides are useful for the 
detection, imaging, localization, and targeting of CEA-containing tissues or solutions, 
e.g., by radioimaging, magnetic resonance imaging, or x-ray imaging, and are also useful 
in the diagnosis and treatment of cancers associated with CEA. 
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RAricr.ROUN n OF THE INVENTION 

Carcinoembryonic antigen, or CEA, is a complex immunoreactive glycoprotein 
with a molecular weight of 1 80,000 found in adenocarcinomas of endodermally derived 
digestive system epithelia and fetal colon. Tumor cells at many sites, including colon, 
breast, lung, cervix, ovary, stomach, bladder, pancreas and esophagus express large 
amounts of carcinoembryonic antigen and/or the closely related immunoglobulin 
supergene family member, nonspecific cross-reactive antigen, or NCA, on their surfaces. 
The expression of these glycoproteins, especially CEA, in normal cells is very limited in 
mature individuals (as opposed to prenatal infants), and this antigen has been used as a 
target in immunoassays for diagnosis and for serially monitoring cancer patients for 
recurrent disease or response to therapy. (See, Mach et al., Immun. Today, 2: 239, 1981 ; 
Berche et al., Br. Med. J., 285: 1447, 1982.) Anti-CEA antibodies also have been 
proposed for cancer therapy and for use in forming immunoconjugates which in turn can 
be used in cancer therapy. (See, e.g., Buchegger et al., U.S. 5,047,507 (1991); Osbourne 

etal. U.S. 5,872,215 (1999).) 

CEA was as first described by Gold and Freedman, J. Exp. Med., 121 : 439, 1965, 
and has now been completely sequenced and characterized (see, Beauchemin et al., Mol. 
Cell. Biol., 7:3221-30, 1987; WO 95/06067). CEA has a domain structure of N-A1-B1- 
A2-B2-A3-B3-GPI where GPI is a glycophosphatidylinositol membrane anchor. A 
significant degree of sequence homology exists between the domains of CEA and other 
members of the immunoglobulin supergene family, and immunological cross-reactivity 
between CEA and as many as sixteen other homologous antigens, such as NCA and 
biliary glycoprotein- 1 (BGP-1), has been reported. 

One of the major drawbacks of the use of anti-CEA antibodies for clinical 
purposes has been the cross-reactivity of these antibodies with some apparently normal 
adult tissues. Previous studies have shown that most conventional hyperimmune antisera 
raised against different immunogenic forms of CEA cross-react with CEA-related 
antigens found in normal colonic mucosa, spleen, liver, lung, sweat glands, 
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polymorphonuclear leukocytes and monocytes of normal individuals, as well as many 

different types of carcinomas. 

Accordingly, there is a great need for binding moieties that bind to CEA but do 
not cross-react with other antigens such as NCA. This and other objects are 
omplished herein with the discovery of novel peptide binders of CEA. 
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SI , MM AP v OF THF INVENTION 

The present invention addresses the need for improved materials and methods for 
detecting, localizing, measuring and treating CEA-expressing cells by providing a group 
10 of non-naturally occurring polypeptides that bind specifically to CEA. Appropriate 
labeling of such polypeptides provides detectable imaging agents that bind at high 
concentration to a CEA-expressing tumor, providing excellent tumor-specific imaging 
agents. Conjugation or fusion of such polypeptides with effective agents such as 
cytokines, chemotherapeutic agents, radionuclides or other cancer therapeutics produce 
1 5 conjugates that can be used for cancer therapy, i.e., by causing the conjugate to target the 
site of a tumor that is producing CEA. Recombinant bacteriophage displaying the CEA- 
binding polypeptides of the invention have been identified and isolated, and such phage 
products are also valuable reagents for effective detection and diagnosis of cancers 
associated with the expression of CEA in cells and tissues. The CEA binding moieties of 
20 the instant invention can be used in the detection, diagnosis, and therapy of such CEA- 
related disorders. 

This invention pertains to CEA binding moieties. Binding moieties according to 
this invention are useful in any application where binding, detecting or isolating CEA or 
its fragments is advantageous. A particularly advantageous use of the binding moieties 

25 disclosed herein is in a method of imaging cells or tissues expressing CEA in vivo. The 
method entails the use of CEA specific binding moieties according to the invention for 
detecting CEA-expressing cells, where the binding moieties have been detectably labeled 
for use as imaging agents, including magnetic resonance imaging (MRI) contrast agents, 
x-ray imaging agents, radiopharmaceutical imaging agents, ultrasound imaging agents, 

30 and optical imaging agents. 
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Preferred CEA binding moieties according to the invention are isolated, synthetic 
polypeptides having a high affinity for CEA. Th.s invention provides a new class of 
CEA binding polypeptides having an amino acid sequence comprising: 

X 1 -X^X 3 -Cys^X 4 -X 5 -X 6 -X 7 -X s -X 9 -X, 0 X i: -CysOUX, X ;4 (SEQ ID NO:l). 

5 wherein 

X,isAsn, Asp, or is absent; 

X 2 is Trp; 

X 3 is Asp, Phe, or Val; 
X 4 is Asn, Glu, or Met; 
10 X 5 is Asn, Leu, Met or Phe; 

X () is Asp, Gly, He, Lys, Phe or Thr; 
X 7 is Ala, Gin, Gly, Lys, or Thr; 
X 8 is Arg, Asn, Asp, Glu, or Gly; 
X 9 is Gin, Gly, or Leu; 
15 X lu is Ala,TrporTyr; 

X, , is Ala, Gly, His, Phe, Thr, or Val; 
X, 2 is Asn, Gin, Phe, Ser or Val; 
X, 3 is Arg, Leu, Pro or Ser; and 
X 14 is Leu, Ser, Trp or Tyr; 
w and wherem said polypeptide has the ability to b,„d CEA. Said polypeptide may have 
add,.io„al amino aeids at.aehed a, either end. Peptides having a serine a. the N-.ermmus 
(before X x ) are preferred embodiments. 

Preferred CEA binding polypeptides of the above formula will have the amino 

acid sequence: 

25 XrTrp-Val-Cys-Glu-X^-Lys- X^ln-Trp- X u -Cys Asn-X 13 -X 14 (SEQ ID 

NO:2), wherein 

X, is Asn or Asp; 

X s is Asn, Leu, Met or Phe; 

X„ is Asp, Gly, He, Lys, Phe or Thr; 
30 X, is Arg, Asn, Asp, Glu, Gly or Trp; 
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X„ is Ala, Gly, His, Phe, Thr, Tyr or Val; 
X,, is Arg, Leu, Pro or Ser; and 
X, 4 is Leu, Ser, Trp or Tyr; 

In particular, a stable binding loop having a h.gh affinity for CEA is disclosed, 
5 having the formula: Cys-X 4 -X s -X 6 -X r -X 1 -X,-X 10 -X 11 -Cys (SHQ ID NO: 3), 
wherein 

X 4 is Asn, Glu, or Met; 

X 5 is Asn, Leu, Met or Phe; 

X 6 is Asp, Gly, He, Lys, Phe or Thr; 
10 X 7 is Ala, Gin, Gly, Lys, or Thr; 

X s is Arg, Asn, Asp, Glu, or Gly; 

X, is Gin, Gly, or Leu; 

X 10 is Ala, Trp or Tyr; 

X„ is Ala, Gly, His, Phe, Thr, or Val; 
1 5 and wherein it is preferred that 

X 4 is Glu; 

X ; is Asn, Leu, Met or Phe; 

X 6 is Asp, Gly, He, Lys, Phe or Thr; 

X 7 is Lys; 

20 X g is Arg, Asn, Asp, Glu, or Gly; 
X q is Gin; 
X I0 is Trp; and 

X„ is Ala, Gly, His, Phe, Thr, or Val. 

Preferred polypeptides according to the invention comprise an amino acid 

is sequence: 

Asn-Trp-Val-Cys-Asn-Leu-Phe-Lys-Asn-Gln-Trp-Phe-Cys-Asn-Ser-Tyr (SEQ ID 
NO:4)(TN10/9-G08), 

Asp-Trp-Val-Cys-Glu-Asn-Lys-Lys-Asp-Gln-Trp-Thr-Cys-Asn-Lcu-Leu (SEQ ID 
NO:5)(TN10/9-A07), 
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Asn Tip- Asp -Cys-Met- Phe Gly- Ala- Glu-Gly- Trp- Ala Cys Ser Pro Trp (SEQ ID 
NO:6)(TN10/9-E01), 

Asp-Trp-Val-Cys-Glu-Lys-Thr-Thr-Gly-Gly -Tyr- Val Cys-Gln- Pro-Leu (SEQ ID 
NO:7)(TN10/9-B09), 

5 Asn-Trp-Phe-Cys-Glu-Met-Ile^Gly^Gln-Trp-Gly-Cys-Val-Pro -Ser (SEQ ID 
NO:8)(TN10/9-Fll),and 

Asp-Tr P -Val-Cys-Asn-Phe-As P -Gln-Gly^Leu-Ala-H 1 s-Cys-Phe^Pro-Ser ( SEQID 

NO:9)(TN10/9-D04). 

The most preferred CEA binding moieties according to the invention are isolated, 
i0 synthetic polypeptides having a high affinity for CEA. This invention provides a new 
class of CEA binding polypeptides having an amino acid sequence comprising: 

X I -X^X^^X^ Xr -X r X^X ir X 1I ^X ir X 1J -^ (SEQ ID NOT), 

wherein 

X 1 is Asp, Asn, Ala, or He, with Asp most preferred; 
15 X 2 isTrp; 

X 3 is Val, He, Met, Tyr, Phe, Pro, or Asp, with Val most preferred; 

X 4 is Asn, Glu, or Asp, with Asn and Glu most preferred; 

X is Leu Phe, Tyr, Trp, Val, Met, He, or Asn, with Leu most preferred; 

X 6 is Phe, Leu, Asp, Glu, Ala, He, Lys, Asn, Ser, Val, Trp, or Tyr, with Phe most 

20 preferred; 

X is Lys, Phe, Asp, Gly, Leu, Asn, or Trp, with Lys most preferred; 

X, is Asn, Pro, Phe, Gly, Asp, Ala, Ser, Glu, Ota. or Trp, with Asn most preferred; 

X, is Gin, or Lys, with Gin most preferred; 

X, 0 is Trp; 

25 X, , is Phe, Thr, Met, Ser, Ala, Asn, Val, His, lie, Pro, Trp, or Tyr, with Phe most 
preferred; 

x is Asn Asp, Glu. Pro, Gin, or Ser, with Asn and Asp most preferred; 

xl is Val, Leu, He, Pro, Ala, Gin, Ser, Met, Glu, Thr, Lys, or Trp, with Val and Leu most 

preferred; and 
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X, 4 is Leu, Met, Val, Tyr, Ala, He, Trp, His, Pro, Gin, Glu, Phe, Lys, or Arg, with Leu 
most preferred. 

The polypeptides listed in Table 5 (infra) are preferred embodiments of the 
present invention. Polypeptides 304A-12-H12 (SEQ ID NO:59), 304A-14-B02 (SEQ ID 
5 NO:74), 304A-14-A12 (SEQ ID NO:83), and 304A-15-E04 (SEQ ID NO:92) are 
especially preferred embodiments of the present invention. 

Another aspect of the present invention relates to modifications of the foregoing 
polypeptides to provide CEA specific imaging agents, wherein the binding moieties are 
modified by radiolabeling, enzymatic labeling, or labeling with MR paramagnetic 
,0 chelates; or wherein the binding moieties are incorporated in microparticles, ultrasound 
bubbles, microspheres, emulsions, or liposomes; or wherein the binding moieties are 

conjugated with optical dyes. 

In another aspect of the present invention, methods for isolating CEA binding 
moieties are provided. Such methods will be useful for isolating additional reagents for 
,5 detection, localization, quantification, and treatment of neoplastic disorders associated 
with upregulated CEA expression. 

In another aspect of the invention, methods of diagnosing CEA-associated 
disorders and methods for localizing CEA-expressing cells or tissues, are provided, and 
methods for treating cancers indicated by increased CEA expression are provided. 
20 In another aspect of the invention, therapeutic agents comprising a combination, 

conjugation or fusion of a anticancer drug or other therapeutic agent with a CEA binding 
moiety according to the invention are provided. Such compositions will be useful in the 
treatment of CEA-associated disorders and conditions. 

In another aspect of the invention, recombinant bacteriophage displaying CEA 
25 binding polypeptides on their surfaces arc also provided. Such phage are useful as 
screening reagents and reagents for detecting CEA. 

Another aspect of the invention relates to forming molecules containing multiple 
CEA-binding moieties to increase the residence time of these molecules on CEA targets. 
These multimeric molecules can be altered to provide CEA specific imaging agents by 
30 radiolabeling. enzymatic labeling, or labeling with MR paramagnetic chelates or 
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oparticles; ultrasound bubbles, microparticles, microspheres, emulsions, or 

liposomes; or optical dyes. 

Another aspect of the invention relates to introducing DNA that encodes one or 
more CEA-binding moieties into the coat protein of a v.rus to cause the vims to bind and 
preferentially infect CEA-bearing cells. Such alteration will make the virus target CEA- 

expressing (tumor) cells. 

These and other aspects of the present invention will become apparent with 

reference to the following detailed description. 

RRTFF DESCRI PTION OF THE DRAWINGS 

Figure 1 shows results of a competition ELIS A with CEA-binding phage isolated 
from the TN10/9 phage display library. The phage bearing CEA binding polypeptides are 
shown to compete for the same binding site (A3 domain) as the a-CEA(A3) chimeric 
antibody, cT84.66. 

Figure 2 shows ELISA scores for TN10/9 isolates A07 and G08, and Lib2 isolates 
304A-15-E04, 304A-12-H12, 304A-14-B02, and 304A-14-A12. In each group, the first 
bar is for wells each having 100 ng CEA. Successive bars are for 50 ng, 10 ng, 5 ng, 1 
ng, 0.5 ng, 0.1 ng and 0 CEA per well. 

Figure 3 shows ELISA scores for TN10/9 isolates A07 and G08, and Lib2 isolates 
304A-15-E04, 304A-12-H12, 304A-14-B02, and 304A-14-A12. The amount of CEA in 
each well is constant at 0.1 ug/well. A soluble peptide having the CEA binder sequence 
of the G08 isolate (SEQ ID NO: 4), DX-207, was added as an inhibitor to CEA binding. 
The first bar in each group represents 200 and successive bars represent 20 >iM, 2 
uM, 200 uM, 20 p.M, 2 uM, 0 M, and no target. 

DFFINITIONS 

In the following sections, the term "recombinant" is used to describe non-naturally 
altered or manipulated nucleic acids, host cells transfected with exogenous (non-native) 
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nucleic ac.ds. or polypept.des expressed non-na.urally. through manipulator, of isolated 
DNA and transformation of hos, ceils. Recombinant is a term tha, specfically encompasses 
DNA molecules which have been constructed ,„ v„™ using genetic engineering technics, 
and use of the tern, "recombinant" as an adjective to desenbe a molecule, construe, vector, 
cell, polypeptide or polynucleotide specifically excludes naturally occumng such 
molecules, constructs, vectors, cells, polypeptides or polynucleotides. 

The term "bactenophage" is defined as a bactenal vtrus containing a DNA core and 
a protective shell built up by the aggregation of a number of different protem molecules. 
The terms "bacteriophage" and "phage" are used herein interchangeably. 

TV term "bindin, moiety" as used herein refers to any molecule, polypept.de, 
pep„dom,me,,c or transformed cell ,",ransforma„f) capable of forming a b.nding complex 
with another molecule, polypeptide, pepfidomime.ic or cell. "CEA binding mo.e.y" ,s a 
binding morety that forms a complex with eareinoembryonie antigen (CEA) or a portton 
thereof whether naturally expressed or synthetic or recombinant, soluble or membrane 
bound included among the portions of CEA specifically contemplated are the N- 
terminal domain (N), and intact domains Al, Bl, A2, B2, A3, or B3, or combmattons of 
two or more such domains in a single conjugate or fusion protein. Particluar mentton „ 
made of the construct N-A3, which is a construe, ftising the N-terminal domain of CEA 
with domain A3 of CEA. The A3 domain does no. exhibit determinants having structural 
cognates in proteins known to be immunologically cross-reactive with CEA, and 
therefore binders to domain A3 may be capable of differentially binding to CEA and no, 
those known s,rue,urally related antigens, such as, especially, NCA. Specific examples 
of CEA binding moieties are the polypeptides mentioned above (SEQ ID NOs: 1-9, 24-27, 
and 36-107), hybrid polypeptides incorporating such polypeptides, and recombinant cells or 
bactenophage displaying any of such polypeptides. Also included whhin the definition of 
CEA binding moieties are polypeptides derived from a polypeptide having an am.no acd 
sequence accord.ng to SEQ ID NO: 1 , above, which have been modified for particular 
results (in addition to CEA or like polypeptide binding ability). Specific examples of 
modifications contemplated are C- or N-,erminal am.no acd substitutions or elongat.ons, 
„ e g for the purpose of l.nking the bind.ng moiety to a detectable imaging label or other 
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s ubstra,c. .„ add.tion to the detectable labels described funher herein, other suitable 
substrates include anticancer drugs or Cher chemotherapeutic agents, enzy.es, ,ox,ns, 
Kposomes (e.g., loaded with a detectable label or chemotheraeut.c agent), or a solid 
supp „ rt , well plare, bead, tube, s„de, filter, or dish. Also specifically contemplated arc 
su bs,i,„.,ons of one or more cysteine restdues that norma.ly form disuifide finks, for 
example substitution with non-na.una.ly occurring m ,no acd restdues having reactive s.de 
Cams for the purpose of forming a more stable bond between those amino acd positions 
than the former disulfide bond. All such modrlied CEA btnding moieties are also 
considered CEA binding moieties so long as they retain the ability to btnd CEA or a 

fragment or domain of CEA. 

The term "binding refers to the determination by standard techniques that a 
binding moiety recognizes and binds revers.b.y ,0 a given target. Such standard 
techniques mclude ecuihbrium dialysis, gel fil.ra.ron, and the monitonng of 
specoscopic changes tha, result from binding, e.g., us.ng fourescence aniso,ropy, Cher 
by direct binding measurements or competition assays with another bmde, 

The term "specificity" refers to a binding moiety having a higher btndtng affintty 
for one target over another. The term "CEA specific,./' refers .0 a CEA bindmg mote.y 
having a higher affinity for CEA as compared with another target, such as a serum protetn 
(e g bovine serum albumin (BSA), human serum albumin (HSA)) or gelattn. 

' The term "polypeptide" refers to a linear polymer of two or more ammo acd 
residues finked w„h amtde bonds, and the term "peptide" is used herem to refer ,o 
relatively short polypeptides, e.g., having fewer than about 30 amino acd, 

In the present application, a CEA binding moiety is said to "target" CEA- 
expressmg cell, if the binding moiety accumuia.es ,n or near .he CEA-expressing cells or 
, ,f the binding mo.ety is selectively taken up by .he CEA-express,ng ce,,s or if .he binding 
moie.y is selectively taken up by and metabolized by the CEA-expressing cells. 
Substances tha, are no, CEA binding mo.eties may be "targeted" to CEA-expressing cells 
by conjugation with CEA binding moieties of the present invention. 
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The term "cross-reactive" is used henin to describe binding associations between 
molecules ak,n ,o the binding of antibodies to antigens. .. , to be understood to refer ,o 
non-covalent binding, not to the formation of eovalent bonds. 

The term "de.ec.ably labeled" is to be understood as ineludtng linking a molecule 
,„ a dye (such as fluorescein), a radionuclide (such as »% an enzyme (such as 
horseradish peroxidase), or detectable metal (such as a paramagnetic ion), wh.ch dye, 
radionuclide, enzyme or meta, can .hereafter be detected by appropriate means. The term 
"detecably labe'ed" also inch*, a binding moiety which has been syntbestzed to 
incorporate a radionuclide (such as »P, '% or »C) in place of a non-radioacttve rsotope 
) of the same clement. 

pe^jnr^scKim:!! i nr nir rmFPPFP fmpqpiments 

The present mvention provides novel binding moieties for CEA. Such bindtng 
moieties make possible .he efficient detection, imaging, localization, and targetmg of 
5 CEA or CEA-related polypeptides in tissues or in a solution or system that contams CEA 
or CEA-related polypeptides. In particular, the binding moieties of thts invention, when 
appropriately labeled, are useful for detecting, imaging, localizing, and targeting CEA- 
expressing cells or for diagnosing CEA speerfre pathophysilogies. The CEA btnd.ng 
polypeptides disclosed herein can thus be used to form a vanety of dragnostie and 
2 » therapeutic agents for diagnosing and treating CEA associated drseases, such as colon 
cancer and other cancers characterized by ovcrexpression of CEA ,n cells, as compared 
with levels of CEA expression in corresponding cells of normal individuals. The 
preferred bindrng moieties of .he present invention bind CEA with high affinity, i.e., 
acting a, low, physiologically relevan. concentrations, comparable ,o known anti-CEA 
25 antibodies and other CEA-binding proteins. 

Preferred CEA binding polypeptides according to the invention will htnd to CEA 
or a fragmen. thereof, bu, will no. bind .0 other proteins .ha, are known to be 
immunologically cross-reactive with CEA, such as NCA. 

Speerf.c CEA binding polypeptides according to the present invention were 
isolated initially by screening of phage display libraries, that is, populations of 
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recombinant bacteriophage transformed to express an exogenous peptide loop on the.r 
surface In order to isolate new polypeptide binding mo,et,es for a particular target, such 
as CEA screemng of large peptide libraries, for example using phage display techniques, 
is especially advantageous, in that very large numbers (e.g., 5 x 1 <f) of potential binders 
can be tested and successful binders isolated in a short period of time. 

In order to prepare a phage library of potential binding polypeptides to screen for 
m embers of the library that might be CEA binding peptides, a candidate binding domain 
is selected to serve as a structural template for the peptides to be displayed in the library. 
The phage library is made up of analogues of this template or "parental domain". The 
binding domain template may be a naturally occurring or synthetic protein, or a region or 
dom ain of a protein. The binding domain template may be selected based on knowledge 
of a known interaction between the binding domain template and CEA, but this ,s not 
cntical In fact, it is not essential that the domain selected to act as a template for the 
library have any affinity for the target at all: Its purpose is to provide a structure from 
which a multiplicity (library) of similarly structured polypeptides (analogues) can be 
generated, which multiplicity of analogues will hopefully include one or more analogues 
that exhibit the desired binding properties (and any other properties screened for). 

In selecting the parental binding domain or template on which to base the 
variegated amino acid sequences of the library, the most important consideration is how 
the variegated peptide domains will be presented to the target, i.e., in what conformation 
the peptide analogues will come into contact with the target. In phage display 
methodologies, for example, the analogues will be generated by insertion of synthetic 
DNA encoding the analogues into phage, resulting in display of the analogue on the 
surfaces of the phage. Such libraries of phage, such as M13 phage, displaying a wide 
variety of different polypeptides, can be prepared using techniques as described, e.g., m 

Kay et al., PJia^eJ^is^^ (ACadCmiC 
Press, Inc., San Diego 1996) and U.S. 5,223,409 (Ladner et al.), both incorporated here.n 

by reference. 

The phage libraries used in the present invenfon are constructed in derivatives of 
, the filamentous phage Ml 3. The displayed pepttdes are fused ,0 the amtno terminus of 
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pro.cin HI mrough a linker pcpt.de wh.ch con,a,n S .he recognition sire for Factor X.. 
Facor Xa can cleave the displayed peptide from .he phage wuhout .njuring .he phage or 

reducing its infectivity. 

For fornrauon of phage drsplay libraries, i. is preferred .„ use a structured 
poly pep.ide as .he bind.ng domain template, as opposed ,o an uns.nrc.ured, ..near 
peptide. Mu.a.ion of surface res.dues in a pro.ein w,l. usua.ly have little effect on .he 
overa.1 structure or general properties (such as size, stab.li.y, and temperature of 
<ena.ura.ton) of the protem; white a, .he same - mu.ation of surface restdues may 
profoundly affect the binding properties of the prote,. The more ttghtly a polypept, e 
scg me„, ,s constrained, the .ess ..ke.y i. is .0 bind to any particular target; however 
p „ ly pep«ide <!oes bind, the bindtng ,s likely to be of higher affintty and of greater 
specify. Thus, i, ,s preferred to select a parental domatn and, in turn, a structure for 
the potential po.ypept.de btnders, that is constrained within a framework having some 
degree of rigidrty. In isolating the specific po.ypep.tdes accordtng to this inventton, four 
phage hbraries were screen, each displaying a short, variegated exogenous pepttde loop 
of , 1 12 or 16 amino acids on the surface of MB phage, a. the amino terminus of pro.e.n 
,„ The libraries were des.gna.ed TN6/6 (having a po.en.tal 3.3 x 10" amtno acd 
sequence diversity), TN7/I (having a potential 5.6 x ,0' amtno acd sequence d.vers.ry,, 
TN8/6 (having apotentia, 6.3 x ,0' amino acd sequence diversity,, and TN.0,9 (hav.ng 
, apotential3 x 10" amino acid sequence diversity). 

The TN6/6 library was cons.ructed .0 display a single microprotetn bindtng loop 
comained tn a ,2-amino acid template. The TN6/6 library utilized a template sequence of 
Xaa-Xaa^Xaa-Cys-Xaa-Xaa-Xaa-Xaa-Cys^Xaa-Xaa (SEQ ID NO: .0). Each 
amino acid position Xaa in the template was varied ,o permi. any amino acd except 
.5 cysleine(Cys). The number of potential designed sequences .s 3.3 x 10'-; 2.0 -< 10 
independen. transforman.s were included in the library. 

The TN-. library was constructed to display a single micropro.ein b,nd.ng loop 
contained in an 1 .-amino acid temp.ate. The TN7/I library utilized a template sequence 
ofXaa-Xaa^Cys-Xaa-Xaa-Xaa^aa^aa^ys^aa^Xaa (SEQ ID NO; 11). The 
,„ am,n„ acds a. the firs, and last pos,.i„ns ,n ,he template (am.no acid positions . and , 
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were varied to permit any ammo acid selected from a group of seven amino acids (i.e.. 
Phe, His, Pro, Leu, Ala, Asp, or Arg); the amino acids at amino acid positions 2 and 10 in 
the template were varied to permit any am.no acid selected from a group of nine amino 
acids (i.e., Leu, Gly, His, Ser, Asp, Arg, Pro, Ala, or Phe); the amino acids at amino acid 
positions 4, 5, 6, 7, and 8 (i.e., between the invariant cysteine residues in the template) 
were vaned to permit any amino acid selected from a group of seventeen amino acids 
(i.e., Thr, lie, Trp, Glu, Tyr, Gin, Asn, Val, Leu, Gly, His, Ser, Asp, Arg, Pro, Ala, or 
Phe). The number of potential designed sequences is 5.6 x 10 9 ; about 1 .0 x 10" 
independent transformants were included in the library. 

The TN8/6 library was constructed to display a single microprotein binding loop 
contained in an 12-ammo acid template. The TN8/6 library utilized a template sequence 
ofXaa-Xaa-Cys-Xaa-Xaa^Xaa^Xaa Xaa-Cys-Xaa-Xaa (SEQ ID NO: 12). The 
amino acids at the first and last positions in the template (amino acid positions 1 and 12) 
were vaned to permit any amino acid selected from a group of four ammo acids (i.e., Ala, 
Asp, Arg, or His); the amino acids at amino acid positions 2 and 1 1 in the template were 
varied to permit any amino acid selected from a group of nine amino acids (i.e., Pro, Ala, 
Phe, Ser. Asp, Arg, Leu, Gly, or His); the amino acids at amino acid positions 4, 5, 6, 7, 8 
and 9 (i.e., between the invanant cysteine residues in the template) were varied to perm.t 
any amino acid selected from a group of thirteen amino acids (i.e., Pro, Ala, Phe, Ser, 
Asp, Arg, Leu, Gly, His, Gin, Asn, Val, or Trp). The number of potential designed 
sequences is 2.2 < 10 15 ; about 1 .0 x 10 9 independent transformants were included in the 
library. 

The TNI 0/9 library was constructed to display a single microprotein binding loop 
contained in an 16-ammo acid template. The TN10/9 library utilized a template sequence 
Xaa-Xaa-Xaa-Cys-Xaa-Xaa-Xaa-Xaa-Xaa-Xaa-Xaa-Xaa-Cys Xaa-Xaa Xaa (SEQ 
ID NO: 1 3). The amino acids amino acid positions 1 , 2, 15 and 16 of the template were 
varied to permit any amino acid selected from a group of ten amino acids (i.e., Tyr, Arg, 
Ser, Trp, Leu, Asn, Pro, Asp, Phe, or His); the amino acids at amino acid positions 3 and 
14 In the template were varied to permit any amino acid selected from a group of fourteen 
, amino acids (i.e., Trp, Tyr, Arg, Ser, Val, Asn, Pro, Gin, Gly, His, Leu, Ala, Asp, or Phe); 
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the ammo acids a, amino acd pos,.,ons 5. 6, 7, 8. 9. .0, U and 12 (i.e., between the 
invariant cysteine residues tn the template, were vaned to permit any amino acd except 
cystetne. The number of potential designed sequences is 3.0 * 10'-, and about 2.5 . Iff 
independent transformants were included in the library. 

Such small binding loop peptides offer several advantages over large protetns: 
First the mass per binding site is reduced, e.g., such highly stable and low molecular 
weigh, polypeptide domains can show much higher binding per gram than do an.tbodtes 
(1 50 kDa) or single-chatn antibodies (30 kDa). Second, the possibility of non-spec, fic 
binding is reduced because there is less surface available. Third, small proteins or 
polypeptide can (because they are chemically synthesizable) be engineered to have 
unique tethenn* sites such as termina! po.y.ysine segments in a way tha, is impracticable 
for larger proteins or antibodies. Fourth.y, smal. peptides can be combined into homo- or 
hetero-multimers to give erther hybnd binding or avidity effects. Fifthly, a consumed 
polypeptide structure is more likely to retain Us functionality when transferred wtth the 
structural domain intact from one framework ,0 another, mat rs, the binding domatn 
structure is Irkely to be transferable from the framework used for presentation in a library 
(,£, , displayed on a phage) to an isolated protein removed from the presentatron 
framework or immobilized on a chromatographic or other substrate. 

Each of the peptide loop libraries was created by making a designed senes of 
mutations or variations within a coding sequence for the microprotetn template, each 
mutant sequence encoding a binding loop analogue peptide corresponding m overall 
structure ,0 the template except having one or more amino acid variations in the sequence 
of the template. The novel variegated (mutated) DNA provides sequence drversr.y, and 
each transforman, phage dtsplays one variant of the inrtial template amino acid sequence 
encoded by the DNA, leading to a phage population (library) displaying a vast number of 
different but structurally related amino acid sequences. The phage display libranes 
screened for CEA binders contained from 1 00 million to 1 billion variants of the 
respectrve parental domain peptides. The ammo acid variatrons are expected ,0 alter the 
hindtn, properties of the binding loop or domain without significantly altenng its 
„ structure, a, leas, for most subsmutions. 1, is preferred that the amino acid positions that 
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arc selected for variation (variable amino acid positions) will be surface amino acid 
pos.tions, that is, positions in the am.no acid sequence of the doma.ns which, when the 
domain is in its most stable conformation, appear on the outer surface of the doma.n (i.e., 
the surface exposed to solution). Most preferably the amino acid positions to be varied 
will be adjacent or close together, so as to maximize the effect of substitutions. 

As indicated previously, the techniques discussed in Kay et al., Phage Display of 
PT riH~ snH Protein- a t ,W a tnrv Manual (Academic Press, Inc., San Diego 1996) and 
U.S. 5,223,409 are particularly useful in preparing a library of potential binders 
corresponding to the selected parental template. The four libraries described above were 
prepared according to such techniques, and they was screened for CEA binding 
polypeptides against an immobilized CEA-relevant target (i.e., a synthetic fusion peptide 
target composed of a hexahistidine leader, the N domain of CEA and the A3 domain of 

CEA, designated "H6NA3")- 

In a typical screen, a phage library is contacted with and allowed to bind the 
target, in this case, CEA or a particular subcomponent(s), such as NA3, preferably 
presenting structures unique to CEA (i.e., structures not cross-reactive with NCA or other 
CEA-like antigens). The H6NA3 target was selected because the A3 domain was 
believed to be unique to CEA ; antibodies that bind A3 do not cross-react with other 
CEA-related antigens, such as NCA. 

To facilitate separation of binders and non-binders, it is convenient to immobilize 
the target on a solid support. When incubated in the presence of the target, phage beanng 
a target-binding moiety form a complex with the target on the solid support whereas non- 
binding phage remain in solution and may be washed away with buffer. Bound phage 
may then be liberated from the target by a number of means, such as changing the buffer 
to an extreme pH (pH 2 or pH 10), changing the ionic strength of the buffer, adding 
denaturants, or other known means. In the present case, CEA binders associating with 
immobilized target NA3 were eluted either by competition with a known CEA binding 
antibody (cT84.66, a chimeric mouse/human anti-A3 antibody supplied by The City of 
Hope, Duarte CA) or by cleavage with Factor Xa. 
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The recovered phage may then be amplified through infection of bacterial cells 
and the screening process repeated with the new pool that is now depleted in non-b.nders 
and enriched in binders. The recovery of even a few binding phage is sufficient to carry 
the process to completion. After a few rounds of selection, the gene sequences encoding 
s the binding moieties derived from selected phage clones in the binding pool are 

determined by conventional methods, described below, revealing the peptide sequence 
that imparts binding affinity of the phage to the target. When the selection process 
works the sequence diversity of the population falls with each round of selection until 
only good binders remain. The sequences converge on a small number of related binders, 
n typicallv ,0-50 out of the more than 100 million original candidates. An increase in the 
number of phage recovered at each round of selection, and of course, the recovery of 
closely related sequences are good indications that convergence of the library has 
occurred in a screen. After a set of binding polypeptides is identified, the sequence 
information may be used to design other secondary phage libraries, biased for members 
1 5 having additional desired properties. 

After analysis of the sequences isolated from the library screening, a family of 
particular CEA binders was defined. In addition, important consensus motifs were 
observed. The following sequences conforming to the TN10/9 template were found to 
bind a CEA target: 

20 Asn-Trp-Val^Cys-Asn-Leu-Phe^Lys^Asn-Gln-Trp-Phe-Cys-Asn-Ser-TyT (SEQ ID 
NO-4)(TN10/9-G08); 

Asp-Trp-Val-Cys-Glu-Asn-Lys^Lys^Asp-Gln-Trp-Thr-Cys-Asn-Leu-LeuCSEQID 
NO:5)(TN10/9-A07); 

Asn-Trp-Asp-Cys-Met-Phe-G^^ (SEQ ID 

^ NO:6)(TN10/9-E01); 

Asp^Trp Val^Cys-Glu-Lys-Thr-Thr-Gly-Gly-Tyr- Val^Cys-Gln-Pro-Leu (SEQ ID 

NO:7)(TN10/9-B09); 

Asn-Trp-Phe^Cys-Glu-Met-Ile-Gly-Arg-Gln-Trp-Gly-Cys-Val-Pro-Ser(SEQID 
NO:8)(TN10/9-Fll);and 
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Asp Trp Val -Cys-Asn- Phe Asp Gin- Gly-Leu- -Ala-His- Cys Phe-Pro- Ser(SEQ ID 

NO: ( ))(TN10/9-G01). 

When delayed on the phage, these peptides are expeeted to torn! a disulfide 
bond between the Cys residues. In the synthetic peptides, the cysteines are preferably 

s oxidized to form a disulfide. 

This series of CEA binders defines a family of polypeptides including the amino 

acid sequence: 

, ^ vvyyXXX-X -Cvs-X.--X.j-Xm (SEQIDNO.l), 

X 1 -X,-X 3 -Cys-X 4 -X s -X 6 -X 7 -A 8 -x 9 -A 10 a,, <~y=> a u , 3 , 4 . 

wherein: 

10 X, is Asn, Asp, or is absent; 

X ; is Trp; 

X, is Asp, Phe or Val; 
X 4 is Asn, Glu or Met; 
X 5 is Asn, Leu, Met or Phe; 
X„ is Asp, Gly, He, Lys, Phe or Thr; 
X 7 is Ala, Gin, Gly, Lys or Thr; 
X„ is Arg, Asn, Asp, Glu or Gly; 
X 9 is Gin, Gly, or Leu; 
X 10 is Ala, Trp or Tyr; 
X„ is Ala, Gly, His, Phe, Thr or Val; 
X 12 is Asn, Gin, Phe, Ser or Val; 
X 13 is Arg, Leu, Pro or Ser; and 
X 14 is Leu, Ser, Trp or Tyr; 
and wherein said polypeptide has the ability to bind CEA. 
2 s The cysteine residues of the microprotein are believed to form a disulfide bond, 

which causes the microprotein to form a stable loop structure under non-reducing 
conditions. Thus, the invention relates to the discovery of a CEA binding loop 
comprising a polypeptide having the amino acid sequence: Cys-X 4 -X 5 -X 6 -X 7 -X 8 -X,- 
X m -X„ Cys (SEQIDNO.3), 
30 wherein: 
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X 4 is Asn, Glu or Met; 
X, is Asn, Leu, Met or Phe; 
X„ is Asp, Gly, He, Lys, Phe or Thr; 
X 7 is Ala, Gin, Gly, Lys or Thr; 
5 X, is Arg, Asn, Asp, Glu or Gly; 

X 9 is Gin, Gly or Leu; 
X,„ is Ala, Trp or Tyr; and 
X,, is Ala, Gly, His, Phe, Thr or Val. 

Recurrent sequences among isolates recovered in screens and recurrence of certain 
0 an.no acids at the same positions within the isolate peptides gives rise to a preferred 
fam.ly of CEA binding peptides and a sequence that may be useful for designing a 
secondary, directed library for obtaining even h lg her affinity CEA binders. The preferred 
family of peptides have amino acid sequences of the formula: 
Xl -Trp-Val-Cys-Glu-X 5 -X 6 -Lys- X 8 -Gln-Trp- X n -Cys-Asn-X 13 -X 14 (SEQ ID 

15 N0:2), wherein 

X, is Asn or Asp; 

X, is Asn, Leu, Met or Phe; 

X 6 is Asp, Gly, He, Lys, Phe or Thr; 

X 8 is Arg, Asn, Asp, Glu, or Gly; 
20 X„ is Ala, Gly, His, Phe, Thr or Val; 

X 13 is Arg, Leu, Pro or Ser; and 

X u is Leu, Ser, Trp or Tyr. 

The invention also involves a library of phage focused on improved binders to 
CEA This library was constructed from seven sublibranes, each sublibrary allowing five 
2 , ammo acid positions to vary while holding constant nine of the amino acid positions that 
were varied in the initial library (TN10/9). The template structure of the seven 
sublibraries allowed variegation in the forms: 
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Varl : X r X : -X r Cys-X 4 -X r Lys-Lys-Asp-Gln-Tn,-Thr-Cys-Asn-Leu-Lcu (SEQ ID 
N0:14) 

Var2:Asp-T t p-Val-Cys-X 4 -X 5 -X () -X 7 -X s -Gln-Trp-Thr-Cys-Asn-Leu-Lcu(SEQlD 
N0:15) 

5 Var3: Asp-Trp-Val-C y s-Glu-Asn-Lys-X 7 -X 8 -X 9 -X 10 -X ir Cys-Asn-Lcu-Leu(SEQ ID 
NO: 16) 

Var4:Asp-Trp-Val-C y s-Glu-Asn-Lys-Lys-Asp-Gln-X 10 -X ir C y s-X, 2 -X 13 -X 14 (SEQID 
NO: 17) 

Var5: Asp-Trp-Val-C y s-Glu-X 5 -X 6 -L y s-X 8 -Gln-Trp-X n -Cys-Asn-X 13 -Leu (SEQ ID 
10 NO:18) 

, . Tm Va , rv , v v _X-Lvs-X.-Gln-Trp-X,,-Cys-Asn-Ser-Tvr(SEQlD 
NO:19) 

VarV: X r Trp-X 3 -C y s-Asn-Leu-Phe-L y s-Asn-Gln-Tr P -Phe-C y s-X 12 -X 1 3-X 14 (SEQ ID 
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NO:20), 

wherein each X residue was allowed to be, with approximately equal likelihood, each of 
the genetically encodable amino acids except cysteine. The invariant portions of 
sequences Varl through Var5 were based on the most frequently observed CEA binder 
sequence in the initial screening of TN10/9 (see, SEQ ID NO:5, isolate TN10/9-A07). 
The invariant portions of sequences Var6 through VarV were based on the binding 
peptide observed to have the best dissociation constant (see, SEQ ID NO:4, isolate 
TN10/9-G08). 

Additional, preferred CEA binding peptides were isolated from this focused 
library having the sequences as shown in Table 5 (infra). 

Direct synthesis of the peptides of the invention disclosed herein may be 
■omplished using conventional techn.ques including, preferably, solid-phase peptide 
synthesis, although solution-phase synthesis may also be used. In solid-phase synthesis, 
for example, the svnthesis is commenced from the carboxy-terminal end of the peptide 
using an a-amino protected amino acid. t-Butyloxycarbonyl (Boc) protective groups can 
be used for all amino groups, though other protective groups are suitable. See, Stewart et 
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al., Solid-Phase Pe ptide Synthesis (1989), W. H. Freeman Co., San Francisco; and 
McrrifickW. Am. Chem. Soc, 85:2149-2154 (1963). 

Polypeptides according to the invention may also be prepared commercially by- 
companies providing peptide synthesis as a service (e.g., BACHEM Bioscience, Inc., 
King of Prussia, PA; Quality Controlled Biochemicals, Inc., Hopkinton, MA). 

Automated peptide synthesis machines, such as manufactured by Perkin-Elmer 
Applied Biosystems, also are available. 

The polypeptide compound is preferably purified once it has been isolated or 
synthesized by either chemical or recombinant techniques. For purification purposes, 
there are many standard methods that may be employed including reversed-phase 
high-pressure liquid chromatography (HPLC) using an alkylated silica column such as 
C 4 -, C 8 - or C 18 -silica. A gradient mobile phase of increasing organic content is generally 
used to achieve purification, for example, acetonitrile in an aqueous buffer, usually 
containing a small amount of trifluoroacetic acid. Ion-exchange chromatography can also 
15 be used to separate peptides based on their charge. The degree of purity of the 

polypeptide may be determined by various methods, including identification of a major 
large peak on HPLC. A polypeptide that produces a single peak that is at least 95% of 
the input material on an HPLC column is preferred. Even more preferable is a 
polypeptide that produces a single peak that is at least 97%, at least 98%, at least 99% or 
20 even 99.5% of the input material on an HPLC column. 

In order to ensure that the peptide obtained using any of the techniques described 
above is the desired peptide for use in compositions of the present invention, analysis of 
the peptide composition may be carried out. Such composition analysis may be 
conducted using high resolution mass spectrometry to determine the molecular weight of 
25 the peptide. Alternatively, the amino acid content of the peptide can be confirmed by 
hydrolyzing the peptide in aqueous acid, and separating, identifying and quantifying the 
components of the mixture using HPLC, or an amino acid analyzer. Protein sequenators, 
which sequentially degrade the peptide and identify the amino acids in order, may also be 
used to determine definitely the sequence of the peptide. 
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The new class of CEA binding polypcpl.des is des.gned ic be conformational^ 
trained by d.su.fide Imkages between ibc ,wo eys.eine residues in ,he,r sequence. Th.s 
conformational resirain, ensures .ha, .he pep.ides have a siable bind.ng stiuc.ure .ha, 
c„„,r,bu,es ,o ,he peptides' affindy for CEA and the.r specific,,, for CEA over non-CEA 
proteins Other methods for eons,rainin g peptides which would re,ain a s.m.lar 
conformation and CEA specificity for the peptide have been described in the art and are 
contemned herein, ine.uding the substitution of one or more of the cysteine restdues 
widi non-naturaliy occurnng ammo ac.ds or peptidomimctics for ,he purpose of form.ng 
a more s.able or conforma.iona.ly preferred l.nkage between the .wo positions on the 
peptide All such modified CEA binding mo.eties are also considered CEA bmd.ng 
moieties so long as ,hey retain ,he ab.liiy .0 bind CEA or a portion thereof. Non- 
cyclized, or linear, versions of the peptides may also re,a,n modera.e binding ab.l.ty and 
specificity for CEA and could also be employed in the presen, invention. 

Homologues of the CEA binding polypeptides described here.n, as well as 
homology ,0 any subsequent d,scovered CEA binding polypep.ides, may be formed 
by substitution, add,,ion or de.efion of one or more amino acids employmg me,hods well 
known in the art and for particular purposes known in ,he art, such as add.fion of a 
po.yhist.dine »ta„» in order ,0 ass.s, in purifica.io„ or subs.i.u.ion of one up ,0 several 
amino acids in order ,0 obliterate an enzyme cleavage site. Omer specially 
contemplated homologues inch* polypeptides having N,erminal or C-termmal 
mod,f,ea,ions or linkers, sueb as polyglycine or polylysine segments, and alterations ,0 
mclude functional groups, noti*ly hydrazide (-NH-NH.) functionalities, ,0 ass.s, ,n 
immob.lization of binding peptides according ,0 this inven.ion on solid supports. 

Such homologous polypeptides will be understood ,0 fall w.thin ,he scope of the 
present invent.on so long as ,he subs.i.ution, addition or deletion ofam.no acids does no, 

eliminate its ability to bind CEA. 

The term "homologous", as used herein, refers ,0 the degree of sequence 
snrnlanty betiveen .wo polymers (i.e., polypeptide molecules or nucle.c ac.d molecules,. 
When the same nncleC.de or amino acid res.due occup.es a sequence position m .he ,wo 
„ polvmers under comparison, .hen ,he polymers are homologous a, ,ha, position. The 
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percent homology between two polymers is the mathematical relationship of the number 
of homologous positions shared by the two polymers d.v.ded by the total number of 
positions compared, the product multiplied by 100. For example, if the am.no acid 
residues at 60 of 100 amino acid positions in two polypeptide sequences match or are 
homologous then the two sequences are 60% homologous. The homology percentage 
figures referred to herein reflect the maximal homology possible between the two 
polymers, i.e., the percent homology when the two polymers are so aligned as to have the 
greatest number of matched (homologous) positions. Polypeptide homologues within the 
scope of the present invention will be at least 85% and preferably greater than 90% 
homologous to at least one of the CEA binding sequences disclosed herein. 

CEA binding polypeptides according to the present invention also may be 
produced using recombinant DNA techniques, utilizing nucleic acids (polynucleotides) 
encoding the polypeptides according to this invention and then expressing them 
recombinant i.e., by manipulating host cells by introduction of exogenous nucleic acid 
molecules in known ways to cause such host cells to produce the desired CEA binding 
polypeptides. Recombinant production of short peptides (e.g., 16-mers) such as those 
described herein may not be advantageous in comparison to direct synthesis, however 
recombinant means of production may be very advantageous where a CEA binding motif 
of this invention are desired to be incorporated in a hybnd polypeptide or fusion protein. 

The polynucleotides of the present invention may be in the form of RNA or in the 
form of DNA, which DNA includes cDNA and synthetic DNA. The DNA may be 
double-stranded or single-stranded, and if single stranded may be the coding strand or 
non-codmg (anti-sense) strand. The coding sequences for CEA binding polypeptides 
according to the present invention may be manipulated or varied in known ways to yield 
alternative coding sequences that, as a result of the redundancy or degeneracy of the 
genetic code, encode the same polypeptide. 

The polypeptides and polynucleotides of the present invention are preferably 
provided in an isolated form, and preferably are purified to homogeneity. 

Where recombinant production of CEA binding polypeptides is desired, the 
, present invention also contemplates vectors that include polynucleotides of the present 
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invention, host cells .l-a, arc genetically engineered wiih vectors of the ,nve„.,o,, and 
recombinant polypeptides produced by cul.urmg such generally engineered host cells. 
Host cells are genetically engineered (.ransduced or transformed or transfected, w,,h ,he 
vectors of this invention which may be, for example, a cloning vector or an expression 
vector The vector may be, for example, in the form of a plasmid, a viral pan.de, a 
phage etc The engineered host cells can be cultured in conventional nutrient med.a 
modified as appropriate for ac.ivat.ng promoters, selecting transforman.s or amplifying 
the CEA binder-encodmg polynucleot.de, The culture conditions, such as temperature, 
pH and the like, are those suitable for use with the host eel. selected for expression and 
w,ll be apparent to the skilled practitioner in this field. The polynucleotide may be 
included m anv one of a variety of expression vectors for expressing a polypeptide. Such 
vectors include chromosomal, nonehromosomal and synthet.e DNA sequences, e.g., 
denvatives of SV40; bacterial plasmids; phage DNA; baculovirus; yeas, plasmtds; 
vectors derived from combinations of plasmids and phage DNA, viral DNA such as 
vaccima, adenovirus, fowl pox virus, and pseudorabies. However, any other vector may 
be used as long as it is replicable and viable in the host. The appropriate DNA sequence 
ma y be inserted into the vector by a variety of procedures. In genera,, the DNA sequence 
is inserted into an appropriate restriction endonuclease site(s) by procedures known ,n the 
art. Such procedures and others are within the capab.lity of those skilled ,n the art. 

The DNA sequence in the expression vector is operatively finked to an 
appropriate expression control sequenec(s) (promoter) to direct mRNA synthests. As 
representative examples of such promoters, there may be mentioned LTR or SV40 
promoter, the E. coli. lac or trp, the phage lambda P L promoter and other promoters 
known to control expression of genes in prokaryo.ic or eukaryotic cells or ,he,r v.ruses. 
The expression vector also contains a ribosome binding s.te for translation initiation and a 
transcription terminator. The vector may also include appropriate sequences for 
amplifying expression. In addition, expression vectors preferably will conta.n one or 
more selectable marker genes to provide a phenotypie trait for selection of transformed 
host cells, such as dihydrofolate reductase or neomycin resistance for eukaryot.c cell 
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The vector containing the appropriate DNA sequence as hereinabove described, as 
well as an appropriate promoter or control sequence, may be emp.oyed to transform an 
appropriate host to permit the host to express the protein. As representative examples of 
appropriate host cells, there may be mentioned bactenal cells, such as E. coh, 
Streptomvces, Salmonella typhimunum; fungal cells, such as yeast; insect cells such as 
Drosophila and Sf9; animal cells such as CHO, COS or Bowes melanoma; plant cells, 
etc The selection of an appropriate host for this type of CEA binder production is also 
, vitW n the capability of those skilled in the art from the teachings herein. Many suitable 
vectors and promoters useful in expression of proteins according to this invention are 
known to those of skill m the art, and many are commercially available. The following 
vectors are provided by way of example. Bactenal: P QE70, pQE60, pQE-9 (Qiagen), 
pBS( + or -), pDIO, Prescript, ps,X174, pBluescnpt SK, pBSKS, pNH8A, P NH16a, 
pNHl 8A, PNH46A (Stratagene); ptrc99a, P KK223-3, P KK233-3, P DR540, P RIT5 
(Pharmacia). Eukaryotic: pWLNEO, P SV2CAT, pOG44, pXTl, pSG (Stratagene) 
pSVK3, pBPV, pMSG, pSVL (Pharmacia). Any other plasmid or vector may be used as 
long as it is replicable and viable in the selected host cell. 

Introduction of the vectors into the host cell can be effected by any known 
method, including calcium phosphate transfection, DEAE-Dextran mediated transfection, 
or electroporation (see Davis et al., BasicMg thods in Mnlm.lpr Biology (1986); 
Sambrook et al., MoMaLOmimg, ISBN 0-87969-309-6, (1987)). 

In the practice of the present invention, a determination of the affinity of the CEA 
binding moiety for CEA relative to other components of a sample is a useful measure, 
and is referred to as specificity for CEA. Standard assays for quantitatmg binding and 
determining affinity include equilibrium dialysis, equilibrium binding, gel filtration, 
surface plasmon resonance, m.crobalances (Hengerer et al., Biotechnigues, 26(5):956-60, 
962 964 (1999)) or the monitoring of numerous spectroscopic changes (such as 
fluorescence) that may result from the interaction of the binding moiety and its target. 
, These techniques measure the concentration of bound and free hgand as a function of 
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ligand (or protein) concentration. The concentration of bound polypept.de ([Bound]) is 
related to the concentration of free polypept.de ([Free]) and the concentration of b.nd.ng 
sites for the polypeptide, i.e., on CEA, (N = total CEA), as described in the following 
equation: 

5 [Bound] = N x |Free]/((K D )+[Freel). 

A solution of the data to this equation yields the dissociation constant, K D , a 
quantitative measure of the binding affinity. The association constant, K a is the 
reciprocal of the dissociation constant, K D . A peptide having a K D 2 times higher for HSA 
(or some other non-CEA target such as NCA) than for CEA would be considered as a 
o weak CEA binder. A peptide having a K D 10 times greater for HSA than CEA would be a 
moderate CEA binder, and a peptide having a K D 100 times or more greater for HSA than 
for CEA would be termed highly specific for CEA. Preferably the peptides and agents of 
the present invention have a K D at least 2 times higher for HSA than for CEA, more 
preferably at least 10 times higher, even more preferably at least 100 times, and most 
1 5 preferably at least 1 000 times higher. 

For most uses, the lower the dissociation constant, the better. Preferred CEA 
binders according to the invention will have a K d for CEA of less than 1 0 nM, more 
preferred CEA binders will have a K d for CEA of less than 1 nM, and most preferred 
CEA binders will have a K d for CEA less than 0.1 [iM or lower. The first set of CEA 
20 bmders isolated from the TN10/9 library had a K d for CEA in the range of 1 uM to 7 jiM. 

t Tc^c far CPA Rinding Polypeptides 

The CEA binding moieties according to this invention will be useful for detecting 
the presence of CEA in blood or other biological fluids and/or for localizing or imaging 
25 of CEA expression in vitro or in vivo, and particularly for detection and/or imaging of 
CEA-expressing cells and tissues. Any suitable method of assaying or imaging CEA may- 
be employed. 
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DctectimLQf£EA 

Assays for CEA using the CEA binding moieties of the present invention can be 
d.rec, binding, competitive binding, or sandwich assays. The CEA binding m„,e,y can be 
attached to a surface and used in a surface-plasmon resonance or mierobalanee to detect 

the binding of CEA directly. 

Alternatively, bacteriophage that display a CEA binding peptide can be incubated 
with cells to be tested for CEA expression. The cells can be spun down ,n a manner that 
does no, sedtmen, the phage, and the presence of phage in the eel. pellet can then be 
detected with labeled antibodies that bind to the phage. Another useful detection assay 
, utilizes toectf.lv labeled CEA binding moiety, which is mixed with cells to be tested for 
surface expression of CEA. After incubation, the cells ate spun down and the presence 
(or absence) of CEA is detected by the presence (or absence, of the label m the cell pellet. 

In a further detection method, a CEA binding moiety can be immobilized, a 
sample to be tested is contacted with the tmmobiltzed CEA btnding moiety, and after 
,5 incubation, the sample is removed and the container washed. The presence of CEA ,s 
detected with a deteetab.y labeled antibody that binds CEA. This antibody need no. 
distinguish between CEA and other cross-reactive antigens where preferred polypeptides 
according to this invention are used, because only CEA will have been captured by the 
immobilized bindtng mo.ety (which, in preferred features, ,s specific for CEA and not for 
20 cross-reactive species such as NC A). 

In another method, a CEA binding mo.ety of the present invention is immobilized 
in a well; detectably labeled CEA is bound to the CEA binding moiety; a sample » then 

u o ArTPA ic detected bv the release of the labeled CEA 
added, and the presence or absence of CEA is detected oy inc 

or the retention of the labeled CEA. 

For detection or purification of CEA or CEA-cxpressing cells in or from a 
solution, a binding moiety of the invention can be immobilized on a solid substrate such 
as a chromatographic support or other porous materia,, then the immobilized b.ndtng 
moiety can be loaded or contacted with the solution under conditions suitable for 
formation of a binding moiety/CEA complex. The non-binding portion of the solution 
„ can be removed and the complex may be detected, e.g., using an an.t-CEA or an,,-b,nd,ng 
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moiety antibody, or the CEA target may be released from the binding moiety at 
appropriate clution conditions. 

Tumor Imaging 

A particularly preferred use for the polypeptides according to the present 
invention is for creating visually readable images of tumors including neoplastic cells 
expressing high levels of CEA, to aid in the diagnosis, monitoring and treatment of CEA 
associated cancers or other disorders. 

The CEA binding moieties disclosed herein may be converted to imaging reagents 
for detecting CEA-expressing tumors by conjugating the polypeptides with a label 
appropriate for diagnostic detection. Preferably, a CEA binding polypeptide exhibiting 
much greater specificity for CEA than for NCA is used. A polypeptide according to this 
invention may be conjugated or linked to a label appropriate for the detection 
methodology to be employed. For example, the CEA binder may be conjugated with a 
paramagnetic chelate suitable for magnetic resonance imaging (MRI), with a radiolabel 
suitable for x-ray imaging, with an ultrasound microsphere or liposome suitable for 
ultrasound detection, or with an optical imaging dye. 

Suitable linkers for conjugating the polypeptide binder to a detectable label can be 
substituted or unsubstituted alkyl chains, amino acid chains (e.g., polyglycine), 
polyethylene glycols, polyamides, and other simple polymeric linkers known in the art. 
Many heterobifunctional linkers are also known and are commercially available. 
Detectable labels may also be bound directly to the CEA binding moieties, e.g., at a 
lysine side chain or other reactive site that does not interfere with CEA/binding moiety 
interaction. 

Molecules that contain multiple copies of a CEA-binding moiety are likely to 
have longer residence times both at the tumor and in circulation. It is desirable to have an 
agent that is intended to bind CEA for either imaging or therapy to remain in circulation 
for at least a few hours so that it has time to reach the tumor. Once the agent has reached 
the tumor, it is desirable that it stay there as long as possible. 
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Many technics are known for preparing mullimeric forms of binding molecules, 
and such techniques may be used ,o prepared CEA binding mul.imers having increased 
scrum half life and higher avidily for CEA. For example, one of the CEA binding 
peptides of the present mven.ion can be synthesized with a Cerminal extention of Gly- 
Gly-Lys The side groups of the all other residues are protected in one way and the 
terminal Lys side group and carboxyl groups are protected in an orthogonal manner. The 
terminal Lys amine group is deprotec.ed and a chelator group is attached. The carboxyl 
groups is deprotected and two copies are joined using bifunc.iona. polyethylene glycol 
reagents. The other protecting groups are removed. Just before use, a suttable 
radionuclide is added. For imaging »"Tc is a preferred radionuclide and HYNIC ,s a 
preferred chelator. Alternately, moieties other than chelators could be coupled to the 
lysine extentton. Whole an.ibodtes show detectable avidity effects when bindtng to CEA. 
Thus comparable avidity effects are expected in a dtmerized binding peptide aecordmg to 
the invention, if the peptide moieties of one molecule can be separated by a dtstance 
srmtlar to the separation between the combining sites of an antibody (i.e., on the order of 
,00 A. A linker containing -80 units of -(CH.-CH.-O)- should allow sufficient 
separation and increase the serum residence time. 

,n general, the technique of using a detectably labeled CEA binding moiety in 
v/vo for diagnosis is based on the premise that the label generates a signal that is 
detectable outside the patient's body. When the detectably labeled CEA binding motety 
is admtnistered to the pattent suspected of having a CEA-expressing tumor, the htgh 
affinity of the CEA binding motety for CEA on a tumor causes the CEA binding motety 
,„ bind to the tumor and accumulate label at the site of the tumor. Sufftcten, fme ,s 
allowed for the labeled peptide to localize a, the site of the tumor. The signal generated 
by the labeled peptide may then be detected by a scanning device which w,ll vary 
according to the type of label used, and the signal is then converted ,„ an tmage of the 
tumor. 

Therapeutic Applications 

The CEA binding moieties of the present invention can be used to improve the 
, activity of anti-cancer drugs or tumor-killing agents by providing or improving their 
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affinity for CEA. In this aspect of the invention, hybrid anti-tumor agents are provided 
by conjugating a CEA binding moiety according to the invention with a drug or other 
agent lethal to the tumor. The CEA binding mo.ety component of the conjugate causes 
the anti-tumor agent to target the sites of CEA-express,ng cells, and to improve the 
s affinity of the conjugate for the cells, so that the anti-tumor activity of the conjugate is 
more localized and concentrated at the sites of tumors. Such conjugates will be useful m 
treating CEA associated diseases, especially colon cancer, in humans and animals, which 
method comprises administering to a human or animal in need thereof an effective 
amount of a CEA binding moiety according to the invention conjugated with an 
o appropriate therapeutic agent. The invention also provides the use of such conjugates in 
the manufacture of a medicament for the treatment of diseases associated with the 
overexpression of CEA by cells in humans and animals. 

A CEA binding moiety of the present invention may be advantageously used to 
target a toxin, radioactivity, cytolytic T cells, cytokines, chemotherapeutic agents or other 
15 molecules to a tumor expressing CEA. 

In the above treatment method, the compounds may be administered by any 
convenient route customary for anti-tumor treatments, for example by infusion or bolus 
injection. In a preferred embodiment, the composition may be formulated in accordance 
with routine procedures as a pharmaceutical composition adapted for intravenous 
20 administration to human beings. Typically, compositions for intravenous administration 
are solutions in sterile isotonic aqueous buffer. Where necessary, the composition may 
also include a solubilizing agent and a local anaesthetic such as lignocaine to ease pain at 
the site of the injection. Generally, the ingredients will be supplied either separately or 
mixed together in unit dosage form, for example, as a dry lyophilised powder or water 
,s free concentrate in a hermetically sealed container such as an ampoule or sachette 

indicating the quantity of active agent in activity units. Where the composition is to be 
administered by infusion, it can be dispensed with an infusion bottle containing sterile 
pharmaceutical grade "water for injection" or saline. Where the composition is to be 
administered by injection, an ampoule of sterile water for injection or saline may be 
30 provided so that the ingredients may be mixed prior to administration. 
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The quantity of material administered will depend on the seriousness of the 
condition and position and size of the tumor. The precise dose to be employed and mode 
of administration must per force in view of the nature of the complaint be deeded 
according to the circumstances by the physician supervising treatment. In general, 
dosages of the CEA binder/anti-tumor agent conjugate will follow the dosages that a,e 
routine for the anti-tumor agent alone, although the improved affinity for CEA added by 

the CEA binder component may allow a decrease in the standard dosage. 

Isolation of CEA binding moieties in accordance with this invention will be 

further illustrated in the following examples. The specific parameters included in the 

following examples are intended to illustrate the practice of the invention, and they are 

not presented to in any way limit the scope of the invention. 

Fvam ple 1 : Proration of a C EA Target for Library Screening 

For screening libraries to isolate binding moieties for CEA, a truncated target 
protein, i.e., H6NA3, consisting of a hexahistidine leader and the N and A3 domains of 
CEA, was used, based on a presumption that binders directed at the A3 domain would not 
be cross-reactive with other antigens, such as NCA, having a high degree of homology to 
CEA. The recombinantly produced H6NA3 protein was dispersed in PBS and added to 
the wells of a 96-well polystyrene microliter plate, at 1 ng/well. The plate was allowed to 
stand overnight at 4°C, which was effective to immobilize target H6NA3 antigen on the 
plate. 

Fv.m ple 2: S ^nin ff of Phaap Display Libraries 

Four phage display libraries were used in the initial screening for CEA binding 
moieties. The libraries were designated TN6/6, TN7/T, TN8/6 and TN10/9. 

The TN6/6 phage display library was composed of recombinant Ml 3 phage 
displaying variegated exogenous single-loop peptides based on a microprotein template 
having the structure Xaa^Xaa-Xaa^Cys-Xaa-Xaa-Xaa^Xaa-Cys-Xaa-Xaa-Xaa (SEQ 
ID NO: 10) and providing 2.0 x 10 8 peptide diversity. 
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The TN7/1 phage chsplay library was composed of recombinant M13 phage 
Slaving var.cga.ed exogenous s,ngle-l„o P peptides based on a m.croprotein template 
hav.ng the structure Xaa-Xaa-Cys Xaa Xaa-Xaa-Xaa -Xaa Cys-Xaa Xaa (SEQ ID 
NO 1 1 ) and providing 1 r. 1 0" peptide diversity. 

The TN8/6 phage display library was composed of recombman, Ml 3 phage 
displaying vanegated exogenous smgle-loop peptides based on a micropro.ein template 
hav.ng the structure Xaa-Xaa^Xaa-Xaa-Xaa-Xaa-Xaa^Xaa-Cys-Xaa-Xaa (SEQ 
ID NO 12) and providing about 1 .0 x 10° peptide diversity 

The TN10/9 phage display library was composed of recombinant M13 phage 
, displaying variegated exogenous single-loop peptides based on a micropro.ein template 
hav.ng ,he structure Xaa-Xaa-Xaa-Cys Xaa-Xaa- Xaa-Xaa-Xaa-Xaa-Xaa-Xaa-Cys- 
X aa-Xaa-Xaa(SEQ.DNO: 13) and providing 2.5 * 10' peptide diversity. 

All of the libraries were constructed so that the phage expressed a vanegated 
pept.de a. the amtno termtnus of protein Hl, and a constant Factor Xa cleavage she was 
„ provided between the diplay peptide and mature protein .11. Each library was separately 
diluted tnto 100 pL of binding buffer (50 mM Tns, 1 50 mM NaCl. 2 mM CaCl„ 0.05% 
Tween-20) before addition to H6NA3-coated wells. 

Each library was allowed to interact with the target separately. After a 2-hour 
incubation with the target to allow binding, the wells of the plate were washed 
20 extensively (1 5 times) to remove unbound or weakly bound phage. Bound phage were 
recovered by eluting the phage by two methods: ft* a competing ligand for CEA, 
namely a chimenc mouse/human an.i-A3 monoclonal antibody, cT84.66 (added at 333 
„M), and second, by Factor Xa cleavage. The eluted phage were recovered and 

propagated overnight. 

The amplifted phage recovered were concentrated, re-exposed to the target and 
e,u,ed with either cT84.66 antibody or Factor Xa. and this procedure was repeated two 
more ttmcs. A progress .ncrease in the elution titer following each of the four rounds 
of screening tnd.cated selection of phage having aff.n.ty for the NA3 target. 
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Fv^m piP V Analysis of Ind ividual Isolates 

After four rounds of select.on, the eluted phage were propagated and a port.on 
p ,ated to isolate phage plaques arising fiom individual clones. Ninety-four such clones 
were selected randomly, propagated, and tested individually for binding to N A3 in a dned 
H6NA3 plate EL1SA. Dried H6NA3 plates were prepared as described above for the 
library screening. Phage samples (-10' phage each) were incubated in the H6NA3 plate 
wells in binding buffer (50 mM Trts, 1 50 mM NaCl, 2 mM CaCl 2 , 0.05% Tween-20) 
containing 0.1% HSA. After 1 hour, the plates were washed 5 times with btndmg buffer. 
Polyclonal anti-M13 antibody conjugated to horseradish peroxidase (Pharmacta) was 
added at 1/5000 dilution in binding buffer to the wells and incubated for 1 hour. The 
wells were a.ain washed 5 times with binding buffer and the presence of the 
antibody/phage/NA3 complex was measured with HRP calonmetric reagents (3,3',5,5'- 
tetramethylbenzidine (TMB) and H A). A high absorbance at 630 nm (due to oxidized 
1MB) was indicative of a tight phage/N A3 mteractton, and phage clones corresponding 
to those wells were identified as bearing CEA-binding moieties. 

ELISAs to assay binding to immobilized HSA (passively bound to the 
polystyrene plate) and a target-free microtiter plate were controls to eliminate phage that 
bound promiscuously or nonspecifically. 

The amino actd sequences of the phage-displayed polypeptides from the ELISA 
positive clones (those positive for NA3 but negative for HSA and the polystyrene plate) 
were deduced by DNA sequencing. The ammo acid sequence data from these phage 
isolates were sorted according to the degree of similarity and response in the H6NA3 
plate ELISA. The results of the screen from the TN10/9 library are set forth in Table 1 . 
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Table 1 : 



Amino acid sequences oi 



TEA-b,nd,ng polypeptides from the TNI 0/9 library 



TNI 0/9 
isolate 



10 



15 



G08 



sequence 



AO 7 



EOl 



B09 



NWVCNLFKNQWFCNSY 



DWVCENKKDQWTCNLL 



NWDCMFGAEGWACSPW 
DWVCELTTGGYVCQPL 




NWFCEMIGRQWGCVPS 
DWVCNFDQGLAHCFPS" 



competition 
elution (AbE) 



cleavage 
elution (FXE) 



0/46 (0.00) 



42/46 (0.91) 



2/46 (0.04) 



1/46 (0.022) 



NWRCKLFPRYPYCSSW 
rRYCEFFPWSLHCUKf 



0/46 (0.00) 



0/46 (0.00) 



6/48 (0.125) 
33/48 (0.69) 



0/48 (0.00) 



0/48 (0.00) 



overall 
fraction (94) 



6/94 (0.064) 



75/94 (0.80) 



2/94 (0.043) 



SEQ 
ID NO. 



4/48 (0.083) 



0/46 (0.00) 



1/46 (0.022) 



1/48 (0.021) 



1/48 (0.021) 



3/48 (0.063) 



1/94 (0.011) 



4/94 (0.043) 



1/94(0.011) 



1/94(0.011) 



4/94 (0.043) 



9 
21 



22 
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The screens of the TN6/6, TN7/1 and TN8/6 libraries did not result in recovery of 
any high affinity CEA binders. Peptides GOl (SEQ ID NO:21) and B10 (SEQ ID NO:22) 
were later found not to bind CEA with useful affinity. 

The polypeptide sequences binding to the H6NA3 target define a cysteme- 
bracketed CEA binding loop often amino acids (including the cysteines), viz., Cys-X 4 - 
X r X 6 -X r -X 8 -X r -X,„-X II -Cys (SEQ ID NO: 3), wherein 
X 4 is Asn, Glu or Met; 
X 5 is Asn, Leu, Met or Phe; 
X 6 is Asp, Gly, He, Lys, Phe orThr; 
X 7 is Ala, Gin, Gly, Lys or Thr; 
X 8 is Arg, Asn, Asp, Glu or Gly; 
X, is Gin, Gly, Leu or Ser; 
X 10 is Ala, Trp or Tyr; and 
X n isAla, Gly, His, Phe, ThrorVal, 
which forms a stable binding site for CEA. 

It is also clear from the selected isolates that one particular sequence, that of AO / 
(SEQ ID NO:5), recurs with high frequency (75/94) and was recovered by both elation 
me thods- competition with cT84.66 (AbE) and Eactor Xa cleavage (FXE). Because th.s 
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p „, y „e P ,,de occurred wHh such h.gh frequency among the .solates. i« was regarded as a 
prcrcrcn „ a , binder, and the other sciences were compared agatnst the A07 science to 
define whether any anr.no acd pos.t.ons m the . 6- m er were conserved. A posmon 
was cons,dered conserved ,f one of the other tsolates expired the same ammo ac, a, he 
same posihon rela,ive to the invanan, cysteine residues; a posttion was considered tugh.y 
conserved if .wo or more .f the other isolates exhibited the same amino acid a. the same 
position relattve to the mvarian, cysteine residues. From thts analysis, the followmg 
conserved sequence (SEQ ID NO: 23) was observed: 

AA position: , 2 3 * > « ' 8 » » » « " " " ^ 

conserved: 

,„ this conserved sequence (SEQ ID NO: 23,, the positions designated »X» showed no 
conservation of an A07 ammo acid, the specrfted amino acid restdues <a, posittons 8, 14 
and ,6) were conserved, and the underscored amino acd resrdues were highly conserved. 
The conserved sequence was used as a parenta. tenrp.ate in the destgn of an addittona,, 
secondary library (Lib2) to be screened for additional high affinity binders of CEA. 

F . rmr l. s- Fpitnne Mmnim- "f Pnare Isolate 

Severa. of the phage isolates were tested to determine whether binding occurred ,n 
,he CEA A3 domain, by performing an ELISA with different concentrations of the an,,- 

,„ each assay, H6NA3 (.00 ng'well) was coated on micro.iter plate wells, b.ocked w„h 
2% non-fat dry milk (Carnation), and peptide displaying phage were added (5 < 
,0-Vwen, m the presence or absence of cT 8 4.66. After washmg, binding to H6NA3 was 
detected using horseradish peroxidase-labeled a„,i-M, 3 antibody. Where cTM.66 was 
used it was added a, lOnMo, lOOnM in separate trials. Addition of BSA was 
performed as a control; and assays usmg non-CEA-binding phage B,0 (SEQ ,D NO:22, 
and GO, (SEQ ID NO:21) were also perfonned as negative controls. The resuHs of the 
mapping ELISA are shown in Fig. I. These resu.ts show that in each case, the previously 
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lso ,atcd CEA binding phage competed for the same binding site with the antibody 
CT84.66, which recognizes the A3 domain of CEA. 



Fvam ple 6: Bindin iLSludies 

The affinity of the peptides displayed by the four highest affinity phage isolates 
(Fig Dwerefurthertestedindirectbindingstud.es. 27-mer peptides including the 
binding loops of the G08, A07, E01 and B09 isolates were synthesized by sohd-phase 
synthesis. The peptides thus prepared are shown in Table 2. 
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Table 2: 



Amino acid sequences of CEA-bmdmg polypeptides for binding studies 
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Peptide 



Synthesized Polypeptide 



1 5 0 5 o => 

p7G08 ^MWTNT^KN ^CNSYAPGGKGGU^K-LUMH, 

-p^OT QnwvrKNKKD OWTCNLLAPGGKGGGSK-lu NH^ 

■zz-^r, 1 snwdcMFGAK GWACSPWAPGGEGGGSK-CUNH, 

gnwVCELTTGG^ VCQPLAPGGKGGGSK-CONH, 



SEQ 
ID NO: 



24 



25 



26 



27 



As seen from Table 2, each of the peptides had an added N-termmal serine 
residue, replicating part of the context of the phage-display peptides. Each peptide was 
also provided with a C-termmal amrde-functional linker useful for immobilizing peptides 
t0 various chromatographic substrates: -Ala 18 -Pro 19 -Gly, 0 -Gly-Glu-Gl y -Gly^Ser^ 
Lys-CONH, (SEQ ID NO:28). The tnpeptide -Ala 18 -Pro 19 -Gly 20 - replicates part of the 
context of the phage-displayed peptides, and the remainder of the C-terminus is a 
synthetic linker for immobilization. A.iquots of each peptide were fluorescently labeled 
us,ng NHS-fluorescein reacted with the s-amino side group of the C-terminal lysine. 

Dissociation constants were determined using fluorescence anisotropy, through 
dir ect binding measurements and competition experiments. In direct binding assays, the 
concentration of the fluorescein-labeled peptide is held constant and the concentration of 
H6NA3 is varied. In the competition experiment, the concentration of the fluorescem- 
iabeled peptide and the H6NA3 target are held constant and the concentration of a 
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ion 



competitor (CT84.66. u„,abe,ed, ,s varied. The change in anisotropy ,s fi. to « c 
^ equation vi. nonlinear regression ,c obtain *e apparent K, The d.ssoc.a 



Table 3. 
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Table 3: 



Dissociation 



^^^^ 



Polypeptide 



P-G08 



Amino Acid Sequence 



P-A07 



SEQ ID NO: 24 



direct binding 



SEQ ID NO: 25 



1.9 \iM 



competition 
(not done) 



5.9 |iM 



P-E01 SEQ ID NO: 26 

]Tgo<j SEQ ID NO: 27 



6.9 nM 



6.0 [iM 



3.6 



5.3 |aM 



1.0 nM 
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These experiments sho. .ha, the peptides bind CEA domain A3 with dissociation 
const a„, ranging from 1 to 7 uM. From these tests, the polypeptide containing the COS 
sequence appears to be the highest affinity binder isolated. 

^^j^MmJas^m m^^ <T A hintin g 

A second TNI 0 library, focused on improved binding peptides for CEA, was 

constructed, using sequence and binding information obtained in the previous exam P ,es. 
From the prevalence of sequences, the polypeptide A07 (SEQ ID NO:5) appeared 

t0 be the bes, binder. Using this sequence as a secondary parenta, domain or template, 5 

amino acd positions »„hin the A07 sequence were vanegated. Five ohgonudcot.des 
instructed that used A07 as the parenta, sequence and ahowed five positions a, a 
^ through a., sequences tha, exclude cysteine. The oligonucleotide sequences 

4 below After observing that the peptide G0 8 had the .owes, K„ additional vanegated 
o„ g „nuc.co,ides coding for peptides based on the paren.a! sequence of G08 were 
dcsig „ed and added ,„ ,he focused library. The oligonucleotide sequences ,ha, encoded 
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G08-based peptides 
below. 



having the designed sequences Var6 and Var7 shown in Table 4, 



Table 4: 



D^i^dTol^e^ "Lib2" 



Encoded Peptides 


Amino Arid Seauence 


SEQ ID NO: 


A07 (parental) 


DW VLbJN JxrJJW" 1 ^ iNiJ - JJ - J 


5 


Varl 


XXXCXXKKDQWTCN^L 


29 


Var2 


DWVCXXXXXQWTCNLL 


30 


Var3 


DWVCENKXXXXXCNLL 


31 

h 32 


Var4 


DWVCENKKDQXXCXXX 




Var5 


DWVCEXXKXQWXCNXL 


33 


G08 (parental) 


NWVCNLFKNQWFCNSY 


4 


Var6 


" NWVCXXXKXQWXCNSY 


34 


Var7 


XWXCNLFKNQWFCXXX 

_ . " 
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The oligonucleotides encoding the peptides Varl through Var7 were mixed in equal 
proportion and a secondary library CLib2"), allowing about 1.7 x 10 7 different 
sequences, was constructed. More than 5 x 10* transformants were obtained, so that all 
allowed sequences should be present. 

rjpjr i.paii^T T ih? and Iso1ates from Lm 

L,b2 was panned through 3 rounds using lower amounts of target. After the third 
round 4 x 96 isolates were test by EL1SA for binding to NA3. Seventy-two isolates were 
ELISA positive and were sequenced. There were 71 distinct sequences, indicating that 
further screening may be needed to identify the very best sequences. The sequences are 
shown in Table 5, below. The seventy-one CEA binders of the Lib2 screen define a 
preferred family of binding moieties having a general formula: 

Xi Tr P -X, C:vs-X,.X, X,-X,X. .X.-Trp X, Cys X..X, X i: (SEQ ID NO:36), 
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wherein: 

X is Asp, Asn, Ala, or lie. with Asp most preferred; 

x! is Val, lie, Met, Tyr, Phe, Pro, or Asp, with Val most preferred; 

X is Asn, Glu, or Asp, with Asn and Glu most preferred; 

X is Leu, Phe, Tyr, Trp, Val, Met, He, or Asn. with Leu most prelerred; 

preferred; 

X is Lys, Phe, Asp, Gly, Leu, Asn, or Trp, with Lys most preferred; 
x', is Asn, Pro, Phe. Gly, Asp, A.a, Ser, Glu, Gin, or Trp, with Asn most 

10 preferred; 

X is Gin, or Lys, with Gin most preferred; 

' * ' . . » Vq , Tie Pro Tro or Tyr, with Phe most 

X, is Phe, Thr, Met, Ser, Ala, Asn, Val, His, lie, Pro, rp, o y 

preferred; 

X„ ,s Asn, Asp, Glu, Pro, Gin, or Ser, with Asn and Asp most preferred; 

X, , is Va., Leu, ..e. Pro, A.a, Gin, Ser, Met, Glu, Thr, Lys, or Trp, w,,h Va. and 

Leu most preferred; and 

X„ is Leu, Me,, Val, Tyr, Ala, lie, Trp, His, Pro. Gin, Glu, Phe, Lys, or Arg, w„h 

Leu most preferred. 
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Several isolates gave especially strong EL.SA signals and these were tested 
further. Figure 2 shows the abiltty of TN.0/9-G08. TNI0/9-A07. 304A-15-E04, 304A- 
,2 HI2 304A-14-B02, and 304A-14-A12 to bind to varying amounts of NA3. The 
binding of G08 and A07 drops off very ,u,c k ly as the amount of NA3 drops below 50 ng 
5 per well The binding of 304A-12-H.2 and 304A-14-A.2 do no. drop off so qu.cKlv. 
The 304A-14-B02 and 304A-15-E04 show binding that is a. leas, as good as G08 and 
A07 

' F , g „re 3 shows .he abtlilty of TN10/9-G08, TN10/9-A07, 304A-1 5-E04, 304A- 
,2 H , 2 304A-1 4-B02, and 304A-14-A, 2 display phage .0 bud N A3 in ,he presence of a 
,n solubie peptide having the G08 sequence (inhibitor peptide, designated DX-207). Ftgure 
3 shows that phage dispiaying peptides A07 and G08 are unable to bind ,„ the presence of 
2 uM mhibrtor pepude, wb„e 304A.,2- H1 2, 304A-.4-B02, and 304A-H-A.2 show 
subs.antta, btndmg in the presence of tnhtbitor. Thts mdica.es .ha. these pep.rdes have 
higher affimty for CEA domain A3 than do G08 or A07. 

Although a number of embodiments and features have been described above, it 
will be understood by those skilled in the art .ha, modifications and varia.ions of .he 
described embodiments and features may be made wi.hou, departing from ,he presen. 
disclosure or the invention as defined in .he appended claims. The publicattons et.ed 
20 herein are incorporated by reference. 
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